Submitted to IEEE Transactions on PAMI SEGMENTATION IMAGE CODER TEXT CODER GRAPHIC

نویسندگان

  • Kamran Etemad
  • David S. Doermann
  • Rama Chellappa
چکیده

A new algorithm for layout independent document image segmentation is suggested. Text, image and graphics regions in a document image are treated as three diierent \texture" classes. Feature vectors based on multi-scale wavelet packet representation are used for local classiication. Segmentation is performed by propagating soft local decisions made on small windows across neighboring blocks and integrating them to reduce their \ambiguities" and increase their \conndence" as more contextual evidence is obtained from the image data. Local votes propagate in a neighborhood, within and across scales, and majorities of weighted votes give the nal decisions. The method has been tested on document page decomposition tasks, and the results of these tests are presented. The algorithm is general, can be applied to other segmentation and classiication tasks, is based on parallel, distributed and independent computations and has low complexity.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classified JPEG coding of mixed document images for printing

This paper presents a modified JPEG coder that is applied to the compression of mixed documents (containing text, natural images, and graphics) for printing purposes. The modified JPEG coder proposed in this paper takes advantage of the distinct perceptually significant regions in these documents to achieve higher perceptual quality than the standard JPEG coder. The region-adaptivity is perform...

متن کامل

Adaptive transforms for image coding using spatially varying wavelet packets

We introduce a novel, adaptive image representation using spatially varying wavelet packets (WPs), Our adaptive representation uses the fast double-tree algorithm introduced previously (Herley et al., 1993) to optimize an operational rate-distortion (R-D) cost function, as is appropriate for the lossy image compression framework. This involves jointly determining which filter bank tree (WP freq...

متن کامل

Wavelet packet image coding using space-frequency quantization

We extend our previous work on space-frequency quantization (SFQ) for image coding from wavelet transforms to the more general wavelet packet transforms. The resulting wavelet packet coder offers a universal transform coding framework within the constraints of filterbank structures by allowing joint transform and quantizer design without assuming a priori statistics of the input image. In other...

متن کامل

A low-complexity region-based video coder using backward morphological motion field segmentation

We introduce a novel region-based video compression framework based on morphology to efficiently capture motion correspondences between consecutive frames in an image sequence. Our coder is built on the observation that the motion field associated with typical image sequences can be segmented into component motion subfield "clusters" associated with distinct objects or regions in the scene, and...

متن کامل

Image compression with a hybrid wavelet-fractal coder

A hybrid wavelet-fractal coder (WFC) for image compression is proposed. The WFC uses the fractal contractive mapping to predict the wavelet coefficients of the higher resolution from those of the lower resolution and then encode the prediction residue with a bitplane wavelet coder. The fractal prediction is adaptively applied only to regions where the rate saving offered by fractal prediction j...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997